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Abstract 

Reversibility is a key issue in tlie interface between computation 
and physics, and of growing importance as miniaturization progresses 
towards its physical limits. Most foundational work on reversible com- 
puting to date has focussed on simulations of low-level machine mod- 
els. By contrast, we develop a more structural approach. We show how 
high-level functional programs can be mapped compositionally (i.e. in 
a syntax-directed fashion) into a simple kind of automata which are 
immediately seen to be reversible. The size of the automaton is lin- 
ear in the size of the functional term. In mathematical terms, we are 
building a concrete model of functional computation. This construc- 
tion stems directly from ideas arising in Geometry of Interaction and 
Linear Logic — but can be understood without any knowledge of these 
topics. In fact, it serves as an excellent introduction to them. At the 
same time, an interesting logical delineation between reversible and 
irreversible forms of computation emerges from our analysis. 



1 Introduction 

The importance of reversibility in computation, for both foundational and, 
in the medium term, for practical reasons, is by now well established. We 
quote from the excellent summary in the introduction to the recent paper 
by Buhrman, Tromp and Vitanyi |19) : 



Reversible Computation: R. Landauer [4T] has demon- 
strated that it is only the "logically irreversible" operations in a 
physical computer that necessarily dissipate energy by generat- 
ing a corresponding amount of entropy for every bit of informa- 
tion that gets irreversibly erased; the logically reversible opera- 
tions can in principle be performed dissipation-free. Currently, 
computations are commonly irreversible, even though the phys- 
ical devices that execute them are fundamentally reversible. At 
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the basic level, however, matter is governed by classical mechan- 
ics and quantum mechanics, which are reversible. This contrast 
is only possible at the cost of efficiency loss by generating ther- 
mal entropy into the environment. With computational device 
technology rapidly approaching the elementary particle level it 
has been argued many times that this effect gains in significance 
to the extent that efficient operation (or operation at all) of fu- 
ture computers requires them to be reversible . . . The mismatch 
of computing organization and reality will express itself in fric- 
tion: computers will dissipate a lot of heat unless their mode of 
operation becomes reversible, possibly quantum mechanical. 

The previous approaches of which we are aware (e.g. [43l [171 [E]) proceed 
by showing that some standard, low-level, irreversible computational model 
such as Turing machines can be simulated by a reversible version of the 
same model. Our approach is more "structural". We firstly define a simple 
model of computation which is directly reversible in a very strong sense — 
every automaton A in our model has a "dual" automaton ^°p, defined quite 
trivially from A, whose computations are exactly the time-reversals of the 
computations of A. We then establish a connection to models of functional 
computation. We will show that our model gives rise to a combinatory 
algebra [33], and derive universality as an easy consequence. This method 
of establishing universality has potential significance for the important issue 
of how to program reversible computations. To quote from [19] again: 

Currently, almost no algorithms and other programs are de- 
signed according to reversible principles ... To write reversible 
programs by hand is unnatural and difficult. The natural way is 
to compile irreversible programs to reversible ones. 

Our approach can be seen as providing a simple, compositional (i.e. "syntax- 
directed" ) compilation from high-level functional programs into a reversible 
model of computation. This offers a novel perspective on reversible comput- 
ing. 

Our approach also has conceptual interest in that our constructions, 
while quite concrete, are based directly on ideas stemming from Linear Logic 
and Geometry of Interaction [25l|26l[^|28l[29lll5l[2ll|22l[l6],and developed 
in previous work by the present author and a number of colleagues O [3l 
El El [71 El Ho]. Our work here can be seen as a concrete manifestation of 
these more abstract and foundational developments. However, no knowledge 
of Linear Logic or Geometry of Interaction is required to read the present 
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paper. In fact, it might serve as an introduction to these topics, from a very 
concrete point of view. At the same time, an interesting logical delineation 
between reversible and irreversible forms of computation emerges from our 
analysis. 

Related Work 

Geometry of Interaction (Gol) was initiated by Girard in a sequence of 
papers [261 1271 128] , extensively developed by Danos, Regnier, Malacaria, 
Baillot, Padicini and others, see e.g. [151 [521 [TB] . I^i particular, Danos 
and Regnier developed a computational view of Gol. In [22] they gave a 
compositional translation of the A-calculus into a form of reversible abstract 
machine. We also note the thesis work of Mackie [H] , done under the present 
author's supervision, which develops a Gol-based implementation paradigm 
for functional programming languages. 

The present paper further develops the connections between Gol as a 
mathematical model of computation, and computational schemes with an 
emphasis on reversibility. As we see it, the main contributions are as follows: 

• Firstly, the approach in the present paper seems particularly simple 
and direct. As already mentioned, we believe it will be accessible 
even without any prior knowledge of Gol or Linear Logic. The basic 
computational formalism is related very directly to standard ideas in 
term-rewriting, automata and combinatory logic. By contrast, much 
of the literature on Gol can seem forbiddingly technical and esoteric 
to outsiders to the field. Thus we hope that this paper may help to 
open up some of the ideas in this field to a wider community. 

• There are also some interesting new perspectives on the standard ideas, 
e.g. the idea of biorthogonal term-rewriting system, and of linear 
combinatory logic (which was introduced by the present author in 

• Prom the point of view of Gol itself, there are also some novelties. 
In particular, we develop the reversible computational structure in a 
syntax-free fashion. We consider a general 'space' of reversible au- 
tomata, and define a linear combinatory algebra structure on this uni- 
verse, rather than pinning all constructions to an induction on a pre- 
conceived syntax. This allows the resulting structure to be revealed 
more clearly, and the definitions and results to be stated more gener- 
ally. 
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We also believe that our descriptions of the linear combinators as au- 
tomata, and of application and replication as constructions on au- 
tomata, give a particularly clear and enlightening perspective on this 
approach to reversible functional computation. 

• The discussion in section 7 of the boundary between reversible and 
irreversible computation, and its relationship to pure vs. applied func- 
tional calculi, and the multiplicative-exponential vs. additive levels of 
Linear Logic, seems of conceptual interest, and is surely worth further 
exploration. 

• The results in section 8 on universality, and the consequent (and some- 
what surprising) non-closure under linear application of finitely de- 
scribable partial involutions, give rise to an interesting, and apparently 
challenging, open problem on the characterization of the realizable 
partial involutions. 

2 The Computational Model 

We formulate our computational model as a kind of automaton with some 
simple term-rewriting capabilities. We assume familiarity with the very 
basic notions of term rewriting, such as may be gleaned from the opening 
pages of any of the standard introductory accounts [23lll0l[Il]. In particular, 
we shall assume familiarity with the notions of signature E = (S„ \ n € uj), 
and of the term algebras Tx; and Ty,{X), of ground terms, and terms in a set 
of variables X, respectively. We will work exclusively with finite signatures 
S. We also assume familiarity with the notion of most general unifier; given 
terms t,u G Ts(X), we write U{t,u)],a ii a : X — > Ty;{X) is the most 
general unifying substitution of t and u, and U{t, u)']' if t and u cannot be 
unified. 

We define a pattern-matching automaton to be a structure 

A = {Q,qL,qf,R) 

where Q is a finite set of states, and qf are distinguished initial and final 
states, and R Q Q x Ty,(X) x Ts{X) x Q is a finite set of transition rules, 
written 

isi,q'i) 
iqN,rN) isN,qN) 
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where qi,qi € Q, ri,Si € Tj]{X), and the variables occurring in Sj are a 
subset of those occurring in r^, 1 < i < A^. It is also convenient to assume 
that no variable appears in more than rule. We also stipulate that there 
are no incoming transitions to the initial state, and no outgoing transitions 
from the final state: q^, ^ q[ and qf ^ qi, I < i < N. 

A configuration of .4 is a pair {q, t) £ Q x of a state and a ground term. 
A induces a relation on configurations: {q,t) {q',t') iff 

3i{qi = q A ql = q' A U{t,ri)ia A t' = cr(si)). 

Note that the "pattern" has to match the whole of the term t. This is akin 
to the use of pattern-matching in functional programming languages such as 
SML |46] and Haskell [l9] , and is the reason for our choice of terminology. 

Note that the cost of computing the transition relation {q, t) {q', t') 
is independent of the size of the "input" term t0 If we are working with a 
fixed pattern-matching automaton A, this means that the basic computation 
steps can be performed in constant time and space, indicating that our 
computational model is at a reasonable level of granularity. 

A computation over A starting with an initial ground term to ^ (the 
input) is a sequence 

{qi,to) {qi,h) 

The computation is successful if it terminates in a configuration {qf,tk), in 
which case t^ is the output. Thus we can see a pattern-matching automaton 
as a device for computing relations on ground terms. 
We say that a pattern-matching automaton 

A = {Q,q„qf,R) 

with 

R={{q^,r,)^{si,q'i) I 1 < ^ < iV} 
is orthogonal if the following conditions hold: 

Non-ambiguity For each i < i < j < N , if qi = qj, then U{ri,rj)^. 

Left-linearity For each z, 1 < i < A^, no variable occurs more than once 
in rj. 

^Under the assumption of left-linearity (see below) which we shall shortly make, and 
on the standard assumption made in the algorithmics of unification [141 I23| that the 
immediate sub-terms of a given term can be accessed in constant time. 
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Note that non-ambiguity is stated in a simpler form than the standard ver- 
sion for term-rewriting systems [14:\ [23l 140] . taking advantage of the fact 
that we are deahng with the simple case of pattern-matching. 

Clearly the effect of non-ambiguity is that computation is deterministic: 
given a configuration {q,t), at most one transition rule is applicable, so that 

the relation is a partial function. 

Given a pattern matching automaton A as above, we define A°'^ to be 

(g,g/,(?„i?°p) 

where 

= {{qls,)^{n,q,)\l<i<N} 

We define A to be biorthogonal if both A and are orthogonal pattern- 
matching automata. Note that if is a biorthogonal automaton, so is A°'^ , 
and ^°P°P = A. 

It should be clear that computation in biorthogonal automata is re- 
versible in a deterministic, step-by-step fashion. Thus if we have the com- 
putation 

{Qi^to) ^ ■■■ ^ iqf,tn) 
in the biorthogonal automaton A, then we have the computation 

{qf,tn) — > ■■■ — > {qL,to) 

in the biorthogonal automaton A°'^. Note also that biorthogonal automata 
are linear in the sense that, for each rule (g, r) (s, q'), the same variables 
occur in r and in s, and moreover each variable which occurs does so exactly 
once in r and exactly once in s. Thus there is no "duplicating" or "discard- 
ing" of sub-terms matched to variables in applying a rule, whether in A or 
in A°P. 

Orthogonality is a very standard and important condition in term-rewriting 
systems. However, biorthogonality is a much stronger constraint, and very 
few of the term-rewriting systems usually considered satisfy this condition. 
(In fact, the only familiar examples of biorthogonal rewriting systems seem 
to be associative/commutative rewriting and similar, and these are usually 
considered as notions for "rewriting modulo" rather than as computational 
rewriting systems in their own right). 

Our model of computation will be the class of biorthogonal pattern- 
matching automata; from now on, these will be the only automata we shall 
consider, and we will refer to them simply as "automata". The reader will 
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surely agree that this computational model is quite simple, and seen to be 
reversible in a very direct and immediate fashion. We will now turn to the 
task of establishing its universality. 

Remark It would have been possible to represent our computational model 
more or less entirely in terms of standard notions of term rewriting systems. 
We briefly sketch how this might be done. Given an automaton 

A = {Q,q„qf,R) 

we expand the (one-sorted) signature S to a signature over three sorts: V 
(for values), S (for states) and C (for configurations). The operation symbols 
in S have all their arguments and results of sort V; for each state q ^ Q, 
there is a corresponding constant of sort S; and there is a binary operation 

{■,■)■■ S xV ^ C. 

Now the transition rules R turn into a rewriting system in the standard 
sense; and orthogonality has its standard meaning. We would still need to 
focus on initial terms of the form ((7^, t) and normal forms of the form {qf, t), 
t ground. 

Our main reason for using the automaton formulation is that it does 
expose some salient structure, which will be helpful in defining and under- 
standing the significance of the constructions to follow. 

3 Background on Combinatory Logic 

In this section, we briefly review some basic material. For further details, 
see [33] . 

We recall that combinatory logic is the algebraic theory CL given by the 
signature with one binary operation (application) written as an infix _ • _, 
and two constants S and K, subject to the equations 

K. ■ X ■ y = X 

S ■ X ■ y ■ z = X ■ z ■ {y ■ z) 

(application associates to the left, so x ■ y ■ z = {x ■ y) ■ z). Note that we can 
define I = S • K • K, and verify that \ - x = x. 

The key fact about the combinators is that they are functionally com- 
plete^ i.e. they can simulate the effect of A-abstraction. Specifically, we can 
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define bracket abstraction on terms in Tc'l{X): 

\*x.M = K-M (x0FV(M)) 
\*x.x = I 

\*x.M-N = S ■ {\*x.M) ■ {X*x.N) 

Moreover (Theorem 2.15 in [33]): 

CL h (A*x. M)-N = M{N/x\. 

The B combinator can be defined by bracket abstraction from its defining 
equation: 

'R ■ x ■ y ■ z = x ■ {y ■ z). 
The combinatory Church numerals are then defined by 

n = (S • B)" • (K • I) 

where we define 

a" ■ 6 = a • (a • • • (a • 6) • • •). 

A partial function : N ^ N is numeralwise represented by a combinatory 
term M G Tcl if for ah n € N, if (j){n) is defined and equal to m, then 

CL \- M ■n = m 

and if (pin) is undefined, then M ■ n has no normal form. 

The basic result on computational universality of CL is then the following 

(Theorem 4.18 in [33]): 

Theorem 3.1 The partial functions numeralwise representable in CL are 
exactly the partial recursive functions. 

4 Linear Combinatory Logic 

We shall now present another system of combinatory logic: Linear Combina- 
tory Logic [HI \T0[ [7]. This can be seen as a finer-grained system into which 
standard combinatory logic, as presented in the previous section, can be 
interpreted. By exposing some finer structure. Linear Combinatory Logic 
offers a more accessible and insightful path towards our goal of mapping 
functional computation into our simple model of reversible computation. 

Linear Combinatory Logic can be seen as the combinatory analogue of 
Linear Logic |25j; the interpretation of standard Combinatory Logic into 
Linear Combinatory Logic corresponds to the interpretation of Intuitionistic 
Logic into Linear Logic. Note, however, that the combinatory systems we 
are considering are type- free and "logic- free" {i.e. purely equational). 
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Definition 4.1 A Linear Combinatory Algebra {A,-, !) consists of the fol- 
lowing data: 



• An applicative structure {A, •) 

• A unary operator \ : A ^ A 

• Distinguished elements B, C, I, K, D, 6, F, W of A 

satisfying the following identities (we associate ■ to the left and write x- \y 



for X- (!(?/)), 


etc.) for all 


variables x, y, 


z ranging over A. 


1. 


'B ■ X ■ y ■ z 


= X- {y- z) 


Composition/Cut 


2. 


C ■ X ■ y • z 


= {x ■ z) -y 


Exchange 


3. 


Ix 


= X 


Identity 


4. 


K • X • !y 


= X 


Weakening 


5. 


D • \x 


= X 


Dereliction 


6. 


6 ■ \x 


= \\x 


Comultiplication 


7. 


F • !x • !y 


= ^-{x-y) 


Monoidal Functoriality 


8. 


W ■ X - ly 


= x-\y-\y 


Contraction 



The notion of LCA corresponds to a Hilbert style axiomatization of the {!, ^ 
} fragment of linear logic [HI [T3| [51] . The principal types of the combinators 
correspond to the axiom schemes which they name. They can be computed 
by a Hindley-Milner style algorithm [34] from the above equations: 

1. B : (/? ^ 7) ^ (a ^ /3) ^ a ^ 7 

2. C : (a^/3^7)^(/3^a^7) 

3. I : a ^ a 

4. K : a^lp^a 

5. D : \a —o a 
6.5 : !a ^ ! !q 

7. F : !(a ^ /3) ^!/3 

8. W : (!a ^ /3) ^ /3 

Here — o is a linear function type (linearity means that the argument is used 
exactly once), and \a allows arbitrary copying of an object of type a. 

A Standard Combinatory Algebra consists of a pair [A, -g) where yl is a 
nonempty set and -g is a binary operation on A, together with distinguished 
elements Bs,Cs,Is,Ks, and of A, satisfying the following identities for 
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all X, y, z ranging over A: 



1. Bs-sX-sV-sZ = x-s{y-sz) 

2. Cs ■ s X ■ s y • s z = {x -g z) -g y 



A. 'Ks-sX-sy = X 

5. Ws-sx-sy = x-sy-sV 



Note that this is equivalent to the more familiar definition of SK-combinatory 
algebra as given in the previous section. In particular, can be defined from 
Bs, Cs, Is and [151 [M]. Let {A,-, !) be a linear combinatory algebra. 
We define a binary operation on A as follows: for a, 6 G j4, a -sh = a-\h. 
We define D' = C • (B • B • I) • (B • D • I). Note that 



Now consider the following elements of A. 

1. B, = C-(B-(B-B-B)-(D'-I))-(C-((B-B)-F)-5) 

2. C, = D' C 

3. = D' I 

4. = D' K 

5. = D' W 

Theorem 4.1 Let (j4, •, !) he a linear combinatory algebra. Then {A,-s) 
with -s and the elements Bg, Cs, Is, K^, as defined above is a standard 
combinatory algebra. 

Finally, we mention a special case which will arise in our reversible model. 
An Affine Combinatory Algebra is a Linear Combinatory Algebra such that 
the K combinator satisfies the stronger equation 



Note that in this case we can define the identity combinator: I = C • K ■ K. 

5 The Affine Combinatory Algebras I and V 

We fix the following signature S for the remainder of this paper. 



Y)' -x-ly = 



X ■ y. 



K. ■ X ■ y = X. 



So 
Si 

S2 



{e} 

{l,r} 

{p} 







n>2. 
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We shall discuss minimal requirements on the signature in Section 6.4. 
We write X for the set of all partial injective functions on T^. 



5.1 Operations on X 
5.1.1 Replication 



5.1.2 Linear Application 



LApp(/,5) = frr^ fri;9;{fii;gT;fir 

where 

fij = I {i{u),j{v)) e /} e {l,r}) 

and we use the operations of relational algebra (union, composition, and 
reflexive, transitive closure). 

The idea is that terms of the form r{t) correspond to interactions be- 
tween the functional process represented by / and its environment, while 
terms of the form l(t) correspond to interactions with its argument, namely 
the functional process represented by g. This is linear application because 
the function interacts with one copy of its argument, whose state changes as 
the function interacts with it; "fresh" copies of the argument are not neces- 
sarily available as the computation proceeds. The purpose of the replication 
operation described previously is precisely to make the argument copyable, 
using the first argument of the constructor p to "tag" different copies. 

The "flow of control" in linear application is indicated by the following 
diagram: 



in 



fr 



out 



rl 



fir 



fll 

Thus the function / will either respond immediately to a request from the 

environment without consulting its argument (/rr), or it will send a "mes- 
sage" to its argument (fri), which initiates a dialogue between / and g {fu 
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and g), which ends with / despatching a response to the environment {fir)- 
This protocol is mediated by the top-level constructors I and r, which are 
used (and consumed) by the operation of Linear Application. 

5.2 Partial Involutions 

Note that / G Z =^ /°p G I, where /°p is the relational converse of /. We 
say that / G X is a partial involution if /°p = /. We write V for the set of 
partial involutions. 

Proposition 5.1 Partial involutions are closed under replication and linear 
application. 

Proof It is immediate that partial involutions are closed under replication. 
Suppose that / and g are partial involutions, and that LApp{f, g){u) = v. 
We must show that LApp{f, g){v) = u. There are two cases. 
Case 1: f{r{u)) = r{v), in which case f{r{v)) = r{u), and LApp{f, g){v) = u 
as required. 

Case 2: for some wi, . . . , Wk, k >0, 

/(r(n)) = l{wi),g{wi) = W2,f{l{w2)) = I{w3),g{w3) = W4, ■ ■ .,f{l{wk)) = l{wk+i), 

g{wk+i) = Wk+2,f{Kwk+2) = r(v). 

Since / and g are involutions, this implies 

f(r{v)) = l{wk+2),g{wk+2) =Wk+l,f{l{wk+l)) =l{wk),...,g{wi) =W3,f{l{w3)) = 1{W2), 

g{w2) = wi,f{l{wi) = r{u), 
and hence LApp{f,g){v) = u as required. □ 

5.3 Realizing the linear combinators by partial involutions 

A partial injective map / G X is finitely describable if there is a family 

{{ti,Ui) \ l<i<k} 

where ti,Ui G Tx;(X), such that the graph of / is the symmetric closure of 

{{a{ti),a{ui)) \a:X^Tj:,l<i<k}. 

Here a : X — > Ts ranges over ground substitutions. 

We write t -tr^ u when (t, u) is in the finite description of a partial invo- 
lution, and refer to such expressions as rules. 
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5.3.1 The identity combinator I 

As a first, very simple case, consider the identity combinator I, with the 
defining equation 

I - a = a. 

We can picture the I combinator, which should evidently be applied to one 
argument to achieve its intended effect, thus: 




out 



Here the tree represents the way the applicative structure is encoded into 
the constructors I, r, as reflected in the definition of LApp. Thus when I 
is applied to an argument a, the /-branch will be connected to a, while the 
r-branch will be connected to the output. The equation I-a = a means that 
wc shoidd have the same informMion at the leaves a and out of the tree. 
This can be achieved by the rule 

I : l{x) -H- r{x) 

and this yields the definition of the automaton for I. 

Now we can show that for any automaton A representing an argument 
a we indeed have 

/lapp(^i,.A) = fA = a. 

Indeed, for any input t 

r{t) ^ l{t) t u r{u) ^ l{u) 



LApp{^i,^) 
t I > U 

5.3.2 The constant combinator K 

Now we consider the combinator K, with the defining equation, with defining 
equation 

K. ■ a ■ b = a. 

We have the tree diagram 
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K 




in2 out 



The defining equation means that we need to make the information at out 
equalt to that at Gi. This can be accomphshed by the rule 

K : l{x) -H- r{r{x)). 

Note that the second argument (€2) does not get accessed by this rule. 

5.3.3 The bracketing combinator B 

We now turn to a more complex example, the 'bracketing' combinator B, 
with the defining equation 



B ■ a ■ 6 ■ c = a • (6 • c). 
B 




in^ out^ c out 

Here, the arguments a and b themselves have some applicative structure 
used in the defining equation: a is applied to the rsult of applying b to c. 
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This means that the automaton reahzing B must access the argument and 
result positions of a and b, as shown in the tree diagram. 

This requires the output out of B to be connected to the output out" of 
a. This translates into the following rule: 

r(r(r(x))) l{r{x)). 

Similarly, the output out* of b must be connected to in", leading to the rule: 

l{l{x))^rmx))). 

Finally, c must be connected to in'', leading to the rule: 

r(/(/(x)) ^ r{r{l{x))). 

5.3.4 The commutation combinator C 

The C combinator can be analyzed in a similar fashion. The defining equa- 
tion is 

C-a-b-c = a- c-b. 

We have the tree diagram 



C 




in2 out" c out 



We need to connect b to in2, c to in^, (this inversion of the left-to-right 
ordering corresponds to the commutative character of this combinator) , and 
out to out". We obtain the following set of rules: 

mx)) ^ r{r{l{x))) 

Rc : l{r{l{x))) o r{l{x))) 

l{r{r{x))) -f-)- r{r{r{x))) 
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Note at this point that hnear combinatory completeness already yields 
something rather striking in these terms; that all patterns of accessing ar- 
guments and results, with arbitrarily nested applicative stucture, can be 
generated by just the above combinators under linear application. 

Note that at the multiplicative level, we only need unary operators in 
the term algebra. To deal with the exponential !, a binary constructor is 
needed. 



5.3.5 The dereliction combinator D 

We start with the dereliction combinator D, with defining equation 

D- \a = a. 

Notice that the combinator expects an argument of a certain form, namely 
!a (and the equational rule will only "fire" if it has that form). 
We have the tree 

D 




out 

We need to connect the output to one copy of the input. We use the constant 
e to pick out this copy, and obtain the rule: 

l(j>{e, x)) -H- r{x). 



5.3.6 The comultiplication combinator 6 

For the comultiplication operator, we have the equation 

6-\a = \\a 

and the tree 

S 




\a ! lout 
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Note that a typical pattern at the output will have the form 

r{p{x,p{y,z))) 
while a typical pattern at the input has the form 

lipix',y')). 

The combinator cannot control the shape of the sub-term at y' , so wc can- 
not simply unify the two patterns. However, because of the nature of the 
replication operator, we can impose whatever structure we like on the 'copy 
tag' x', in the knowledge that this will not be changed by the argument 
\a which the combinator will be applied to Hence we can match these two 
patterns up, using the fact that the term algebra Ts allows arbitrary nesting 
of constructors, so that we can write a pattern for the input as 

l{p{p{x,y),z)). 

Thus we obtain the rule 

l{p{p{x,y),z)) ^ r{p{x,p{y,z))). 

Note that this rule embodies an "associativity isomorphism for pairing", 
although of course in the free term algebra Ts the constructor p is certainly 
not associative. 

5.3.7 The functional distribution combinator F 

The combinator F with equation 



F-!a- \b = \{a-b). 
F 




in" out" \b out 
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F expresses 'closed functoriality' of ! with respect to the hnear horn —o. 
Concretely, we must move the application of a to 6 inside the !, which is 
achieved by commuting the constructors I, r and p. Thus we connect out" 
to out: 

l{p{x,r{y))) o r{r{p{x,y))) 

and in" to lb: 

l{p{x,l{y))) ^ r{l{p{x,y))). 

5.3.8 The duplication combinator W 

Finally, we consider the duplication combinator W: 

W ■a-\b = a-\b-\b. 
W 




lini out" 



We must connect out and out": 

r{r{x)) -H- l{r{r{x))). 

We also need to connect \b both to in" and to in2. We do this by using the 
copy-tag field of lb to split its address space into two, using the constructors 
I and r. This tag tells us whether a given copy of lb should be connected to 
the first (l) or second (r) input of a. Thus we obtain the rules: 

l{l{p{x,y))) ^ r{l{p{l{x),y))) 
l{r{l{jp{x,y)))) ^ r{l(jp{r{x),y))) 

Once again, combinatory completeness tells us that from this limited stock 
of combinators, all definable patterns of application can be expressed; more- 
over, we have a universal model of computation. 
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5.4 The afRne combinatory algebras X and V 

Theorem 5.1 (X, •, !, /b, /c, /k, /d, /5, /f, /w) is an affine combinatory 
algebra, with subalgebra V. 

This theorem is a variation on the resuhs established in O [6l [9l [71 llOj : 
see in particular |10( Propositions 4.2, 5.2], and the combinatory algebra of 
partial involutions studied in [7]. The ideas on which this construction is 
based stem from Linear Logic \25\ [29] and Geometry of Interaction [26| j^T] , 
in the form developed by the present author and a number of colleagues 
[2l[l[5l[6l[9l[7l[IO]. 

Once again, combinatory completeness tells us that from this limited 
stock of combinators, all definable patterns of application can be expressed; 
moreover, we have a universal model of computation. 

6 Automatic Combinators 

As we have already seen, a pattern-matching automaton A can be seen as 
a device for computing a relation on ground terms. The relation i?_4 C 
Xe X Xe is the set of all pairs {t,t') such that there is a computation 

{q,,t) A* {qf,t'). 

In the case of a biorthogonal automaton A, the relation i?_4 is in fact a 
partial injective function, which we write Note that f_/[op = f^, the 
converse of f_/[, which is also a partial injective function. In the previous 
section, we defined a linear combinatory algebra V based on the set of partial 
involutions on X^;. We now want to define a subalgebra of V consisting 
of those partial involutions "realized" or "implemented" by a biorthogonal 
automaton. We refer to such combinators as "Automatic" , by analogy with 
Automatic groups [24], structures |38| and sequences 

6.1 Operations on Automata 
6.1.1 Rephcation 

Given an automaton A = {Q , q^, qf , R) , let x he a variable not appearing in 
any rule in R. We define 

\A={Q,q,,qf, \R) 
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A = {Q,q,,qf,R) B = {P,p„pf,S) 

LApp(A^) = {Q^P,q,,qf,T) 
T= U Rjk ^ S 

j,ke{l,r,i} 

Q'"' = Q\{qc,qf} 

Rrr = {{qc, u) {v, qf) I {qt, r{u)) {r{v), qf) G R} 

Rri = {iq„u) ^ {v,p,) \ {q„r{u)) ^ {l{v),qf) e R} 

Rll = {{Pf^u) {v,p,) I {q„l{u)) {l{v),qf) £ R} 

Rlr = {{Pf,u) {v,qf) I {qi,l{u)) {r{v),qf) G R} 

Rii = {{q,u)^ {v,q')eR\q,q' eQ'"'} 

Rri = {{q,, u) ^ {v, q) I {q„ r{u)) {v, q) e R,q € Q'"*} 

Rli = {ipf,u) {v, q) I {q„ l{u)) (v, q) e R,q e Q'"'} 

Ril = {{q,u)^{v,p,)\iq,u)^iliv),qf)£R,qGQ'"'} 

Rir = {{q, u) {v, qf) I {q, u) {r{v), qf) e R,q E Q'"^} 

Figure 1: Linear Application 

where \R is defined as 

{iq,p{x,r)) {p{x,s),q') \ {q,r) {s,q') G R}. 

Note that the condition on x is necessary to ensure the hnearity of \R. The 
biorthogonahty of \A is easily verified. 

6.1.2 Linear Application 

See Figure 1. Here Q^P is the disjoint union of Q and P (we simply assume 
that Q and P have been relabelled if necessary to be disjoint). 
The key result we need is the following. 

Proposition 6.1 (i) !/^ = /i^. 
(ii) LApp(/^,/s) = /lapp(ab)- 

Proof (i) = p{t,v) iff /^(n) = v iS u ^* v iS p{t,u) ^* 

pit,v). 

(ii) Let C = LApp(^,^). Suppose LApp(/^, fi3){t) = u. Then either /„.(t) = 

u, or fri{t) = V, g{v) = wi, fu{wi) = W2, g{w2) = W3, fu{wk) = Wk+i, 

A * 

g{wk+i) = Wk+2, fir{wk+2) = u. In the first case, {q„r{t)) — > {qf,r{u)), 
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and hence — > (qf^u). In the latter case, {qi^^r{t)) — > {qf^l{v))^ 

{p„v) — y {pf,wi), {q„l{wi)) — > {qf,l{w2))AViiW2) — > {pj^ws),..., 

iqi,l{wk)) — > {qf,l{wk+i)), {p„Wk+i) — > {pf,Wk+2), iq„l{wk+2)) — > 

{Qf,r{u)), and hence again {q„t) {qf,u). Thus LApp(/^,/^) C /lapp(AB)- 
The converse inclusion is proved similarly. □ 

6.2 Finitely describable partial involutions are Automatic 

Now suppose we are given a finite description of a partial involution /. 
We define a corresponding automaton A: 

= {{Qi.,qf},qi.,qf,R) 

where 

^ = U {(Q^^t) ^ iu,qf), {q^,u) ^ {t,qf)}. 

{t,u)es 

It is immediate that = /• 

Note that A has no internal states, and all its rules are of the above 
special form. These features are typical of the automata corresponding to 
normal forms in our interpretation of functional computation. 

6.3 The Automatic Universe 

The results of the previous two sections yield the following Theorem as an 
immediate consequence. 

Theorem 6.1 TZ is an affine combinatory sub-algebra of I, where the car- 
rier of TZ is the set of all /_4 for hiorthogonal automata A. Moreover, 
S = V DTZ is an affine combinatory sub-algebra ofTZ. 

Thus we obtain a subalgebra S of TZ, of partial involutions realized by 
biorthgonal automata; and even these very simple behaviours are computa- 
tionally universal. Partial involutions can be seen as "copy-cat strategies" 

6.4 Minimal requirements on S 

We now pause briefly to consider our choice of the particular signature S. 
We could in fact eliminate the unary operators / and r in favour of two 
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constants, say a and b, and use the representation 

l{t) = p{a,t) 
r{t) = p{b,t) 
p{t,u) = p{e,p{t,u)). 

We can in turn eliminate a and b, e.g. by the definitions 

a = p{e,e) b = p{p{e,e),e). 

So one binary operation and one constant — i.e. the pure theory of binary 
trees — would suffice. 

On the other hand, if our signature only contains unary operators and 
constants, then pattern-matching automata can be simulated by ordinary 
automata with one stack, and hence are not computationally universal [47j . 

This restricted situation is still of interest. It suffices to interpret BCK- 
algebras, and hence the affine A-calculus Recall that the B and C 

combinators have the defining equations 

B ■ X ■ y ■ z = X • {y ■ z) 
C-x-y-z = x- z- y 

and that BCK-algebras admit bracket abstraction for the affine A-calculus, 
which is subject to the constraint that applications M-N can only be formed 
if no variable occurs free in both M and A^. The affine A-calculus is strongly 
normalizing in a number of steps linear in the size of the initial term, since 
/3-reduction strictly decreases the size of the term. 

We build a BCK-algebra over automata by using Linear instead of stan- 
dard application, and defining automata for the combinators B, C and K 
without using the binary operation symbol p. For reference, we give the set 
of transition rules for each of these automata: 
i?K (linear version): 

r(r(x)) o l{x) 

Rb-- 



l{r{x)) 




r(r(r(x))) 


mx)) 




r(/(r(x))) 


r(/(/(x))) 




r(r(Z(x))) 


mx)) 




r{r{l{x))) 


l{r{l{x))) 




r{l{x))) 


/(r(r(x))) 




r(r(r(x))) 



Note that, since only unary operators appear in the signature, these au- 
tomata can be seen as performing prefix string rewriting [39j . 
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7 Compiling functional programs into reversible 
computations 



Recall that the pure A-calculus is rich enough to represent data-types such 
as integers, booleans, pairs, lists, trees, and general inductive types [3^; 
and control structures including recursion, higher-order functions, and con- 
tinuations [50|. A representation of database query languages in the pure 
A-calculus is developed in [32]. The A-calculus can be compiled into combi- 
nators, and in fact this has been extensively studied as an implementation 
technique [H]. Although combinatory weak reduction does not capture all 
of /9-reduction, it suffices to capture computation over "concrete" data types 
such as integers, lists etc., as shown e.g. by Theorem 13.11 Also, combinator 
algebras form the basic ingredient for realizability constructions, which are a 
powerful tool for building models of very expressive type theories (for text- 
book presentations see e.g. [121 120]). By our results in the previous section, 
a combinator program M can be compiled in a syntax-directed fashion into 
a biorthogonal automaton A. Moreover, note that the size of A is linear in 
that of M. 

It remains to specify how we can use A to "read out" the result of the 
computation of M. What should be borne in mind is that the automaton 
A is giving a description of the behaviour of the functional process cor- 
responding to the program it has been compiled from. It is not the case 
that the terms in Ts input to and output from the computations of A cor- 
respond directly to the inputs and outputs of the functional computation. 
Rather, the input also has to be compiled as part of the functional term to 
be evaluated — this is standard in functional programming generally!! The 
automaton resulting from compiling the program together with its input can 
then be used to deduce the value of the output, provided that the output is 
a concrete value. 

We will focus on boolean-valued computations, in which the result of the 
computation is either true or false, which we represent by the combinatory 
expressions K and K • I respectively. By virtue of the standard results on 
combinatory computability such as Theorem 13. H for any (total) recursive 
predicate P, there is a closed combinator expression M such that, for all n, 
P{n) holds if and only if 

CLhM-n = K, 

^However, note that, by compositionality, the program can be compiled once and for 
aU into an automaton, and then each input value can be compiled and "linked in" as 
required. 
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and otherwise CL h M • n = K • I. Let the automaton obtained from the 
term M • n be Then by Theorem 16. H fj\^ = /k or /_4 = /k i- Thus to 
test whether P{n) holds, we run A on the input term r(r(e)). If we obtain 
a result of the form l{u), then -P(n) holds, while if we obtain a result of the 
form r{v), it does not. Moreover, this generalizes immediately to predicates 
on tuples, lists, trees etc., as already explained. 

More generally, for computations in which e.g. an integer is returned, 
we can run a sequence of computations on the automaton A, to determine 
which value it represents. Concretely, for Church numerals, the sequence 
would look like this. Firstly, we run the automaton on the input r(r(e)). 
If the output has the form r{l{u)) (so that the term is 'A/. Ax. x') then the 
result is 0. Otherwise, it must have the form l{p{u, r{v))) (so it is of the form 
A/. Ax. / . . ., i.e. it is the successor of . . . ), and then we run the automaton 
again on the input term l{p{u,l{p{e,v))). If we now get a response of the 
form r{l{u)), then the result is the successor of 0, i.e. 1 (!!). Otherwise . . . 

In effect, we are performing a meta-computation (which prima facie is 
irreversible), each "step" of which is a reversible computation, to read out 
the output. It could be argued that something analogous to this always 
happens in an implementation of a functional programming language, where 
at the last step the result of the computation has to be converted into human- 
readable output, and the side-effect of placing it on an output device has to 
be achieved. 

This aspect of recovering the output deserves further attention, and we 
hope to study it in more detail in the future. 

Pure vs. Applied A-calculus 

Our discussion has been based on using the pure A-calculus or CL, with no 
constants and (5-rules [33l [ll]- Thus integers, booleans etc. are all to be 
represented as A-terms. The fact that A-calculus and Combinatory Logic 
can be used to represent data as well as control is an important facet of 
their universality; but in the usual practice of functional programming, this 
facility is not used, and applied A-calculi are used instead. It is important to 
note that this option is not open to us if we wish to retain reversibility. Thus 
if we extend the A-calculus with e.g. constants for the boolean values and 
conditional, and the usual (5-rules, then although we could continue to in- 
terpret terms by orthogonal pattern-matching automata, biorthogonality — 
i.e. reversibility — would be lost. This can be stated more fundamentally 
in terms of Linear Logic: while the multiplicative-exponential fragment of 
Linear Logic (within which the A-calculus lives) can be interpreted in a 
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perfectly reversible fashion (possibly with the loss of soundness of some 
conversion rules [26\ llOj). this fails for the additives. This is reflected for- 
mally in the fact that in the passage from modelling the pure A-calculus, 
or Multiplicative-Exponential Linear Logic, to modelling PCF, the prop- 
erty of partial injectivity of the functions /_4 (the "history- free strategies" 
in [3l [8]) is lost, and non-injective partial functions must be used [3l 13 31]. 
It appears that this gives a rather fundamental delineation of the boundary 
between reversible and irreversible computation in logical terms. This is 
also reflected in the denotational semantics of the A-calculus: for the pure 
calculus, complete lattices arise naturally as the canonical models (formally, 
the property of being a lattice is preserved by constructions such as func- 
tion space, lifting, and inverse limit), while when constants are added, to be 
modelled by sums, inconsistency arises and the natural models are cpo's [1]. 
This suggests that the pure A-calculus itself provides the ultimate reversible 
simulation of the irreversible phenomena of computation. 

8 Universality 

A minor variation of the ideas of the previous section suffices to establish 
universality of our computational model. Let be a recursively enumerable 
set. There is a closed combinatory term M such that, for all n G N, 

new CL h M • n = 

and \i n ^ W then M ■ n does not have a normal form. Let A be the 
automaton compiled from M ■ n. Then we have a reduction of membership 
in W to the question of whether A produces an output in response to the 
input r(r(e)). As an immediate consequence, we have the following result. 

Theorem 8.1 Termination in hiorthogonal automata is undecidable; in fact, 
it is 'El-complete. 

As a simple corollary, we derive the following result. 

Proposition 8.1 Finitely describable partial involutions are not closed un- 
der linear application. 

Proof The linear combinators are all interpreted by finitely describable 
partial involutions, and it is clear that replication preserves finite describabil- 
ity. Hence if linear application also preserved finite describability, all com- 
binator terms would denote finitely describable partial involutions. How- 
ever, this would contradict the previous Theorem, since termination for a 
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finitely describable partial involution reduces to a finite number of instances 
of pattern-matching, and hence is decidable. □ 
This leads to the following: 

Open Question: Characterize those partial involutions in 

iS, or alternatively, those which arise as denotations of combina- 
tor terms. 
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